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(57) Abstract: Con.itructs which include a translationa) 
fusion of 14 amino acids derived from the green fluorescent 
protein are provided for the enhanced expression of proteins. 
Methods for the use of such constructs are also provided. 
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PCTAJSOO/18096 



ENHANCED EXPRESSION OF PROTEINS 



INTRODUCTION 

Technical Field 

This invention relates to the application of genetic engineering techniques. 
Specifically, the invention relates to compositions and methods for enhancing expression 
of proteins in host cells. 

Background 

The levels of expression of proteins is often a limiting factor in the usefiilness of a 
system for commercial purposes. In particular, prokaryotic and eukaryotic expression 
systems do not produce desired protein levels. The expression of proteins in plant ceUs 
has been particularly limited due to low levels of expression of tiie introduced gene. 

As such, tiiere remains a need for compositions and methods which enhance the 
expression of proteins. 

SUMMARY OF THE INVENTION 

The present invention provides nucleic acid sequences useful in enhancing 
expression of a wide variety of genes, both eukaryotic and prokaryotic. In particular, 
translational fusion of fourteen (14) amino acids derived from the green Huorescent protein 
(GFP) to the N-tenninal portion of protein resulted in very high levels of protein 
expression. Constiucts comprising DNA encoding the 14 amino acids of GFP fused to the 
N tenninus of a gene for the enzyme 5-enolpyruvylshikimate-3-phosphate synthase 
(hereinafter referred to as EPS? synthase or EPSPS) are provided. The overexpression of 
the EPSPS enzyme was shown to provide high levels of vegetative tolerance and 
reproductive tolerance to glyphosate. 

The expression constructs for use in this invention generally include a promoter 
region capable of providing for enhanced expression of a DNA sequence, a DNA sequence 
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encoding the 14 amino acid sequence from GFP fused to a DNA sequence encoding a 
protein of interest, and a transcription termination region. 

The promoter region of the present invention is preferably linked to a ribosome 
binding site which provides for enhanced translation of mRNA transcripts. 
5 The expression construct of this mvention is preferably linked to a construct having 

a DNA sequence encoding a selectable marker. Expression of the selectable marker 
allows the identification of cells expressing the marker. 

In one preferred embodiment, the protein of interest is expressed in a plant plastid. 
Constructs which are useful for such expression are described herein and are described in 
10 co-pending application U.S. Serial No. 09/1 13,257 and the continuation-in-part application 
thereof which is being filed on even date herewith. 

For expression in plastids, the vectors for transfer of the construct into a plant cell 
preferably include means for inserting the expression and selection constructs into the 
plastid genome. The vectors preferably comprise regions of homology to the target plastid 
15 genome which flank the constructs. In addition, the constructs for expression in plastids 
preferably comprise a promoter sequence linked to a ribosome binding site capable of 
enhancing the translation of mRNA transcripts in the plant plastid. The ribosome binding 
site is preferably derived from the T7 bacteriophage gene 10 leader sequence. 

The translational fusion of 14 amino acids from GFP can be used to enhance the 

2 0 expression level of any enzyme or protein for which the nucleic acid sequence is known. 

However, of panicular interest is the high level expression of nucleic acid sequences 
encoding for enzymes involved in herbicide tolerance and encoding pharmaceuUcal 
proteins. 

In one preferred embodiment, the constructs of the present invention comprise a 
2 5 DNA sequence encoding 5-Enolpyruvylshikimate-3-phosphate synthase (USPN 5,633,435, 
the entirety of which is incorporated herein by reference), nitrilase, phytoene desaturase, 
aprotinin or a DNA sequence encoding human growth hormone (USPN 5,424.199, the 
entirety of which is incorporated herein by reference) It is especially preferred that the 
construct comprise a DNA sequence encoding the EPSPS gene from Agrobacterium strain 

3 0 CP4 (U.S. Patent No. 5,633,435). 

Hosts cells, including plant plastids, containing the constructs are also 
contemplated in the invention. In addition, plants, plant seeds, plant cells or progeny 
thereof containing plastids comprising the construct are contemplated. 
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The present invention also includes methods for enhanced expression of DNA 
sequences in host cells. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 provides the nucleic acid sequence for the Prm/GIOL promoter/RBS 
hybrid. The Prrn promoter contains the consensus plastid -35 and -10 promoter elements 
(underlined) and the transcription start sites (GC in bold) for the Plastid-Encoded RNA 
Polymerase (PEP). The gene 10 leader (GIOL) contains a perfect plastid ribosome binding 
1 0 site (RBS, nucleotides in bold). 

Figure 2 provides maps of constructs pMON30123, pMON30130, pMON38773, 
pMON38798, pMON45259, pMON49218, pMON45201 and pMON45204. As shown, 
^ constructs pMON45259 and pMON49218 include the 14 amino acids of the green 
1 5 fluorescent protein. 

Figure 3 provides detailed maps of constructs pMON45259 and pMON49218. 

Figure 4 provides the nucleic acid sequence for the Prm/NEP/G10L::14aaGFP 
2 0 fusion. The NEP promoter region is underlined (A in bold is transcription start site). The 
NEP promoter region used extends beyond the consensus sequence both upstream and 
downstream of the promoter. The initial ATG, the initiator methionine is not counted in 
the 14 amino acids of GFP. 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to the transiational fusion of fourteen (14) 
amino acids derived from the green fluorescent protein (GFP) to the N-terminal portion of 
a protein otinterest;. -The-fourteen amino acids of-GFP and the-polynucleotide- sequence ^ 
which encodes these amino acids are set forth below: 

GGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA AGC 

ATG 

Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ser 

Met 



It is contemplated that sequences encoding portions of the first 14 amino acids of 
GFP can also be used in the methods of the present invention. In particular, sequences 
15 encoding between the first 5 and the first 14 amino acids of GFP, preferably sequences 

encoding between the first 5 and 10 amino acids can also be used in the present invention. 

The fusion resulted in very high levels of protein expression. As described in 
Example 3. constructs comprising DNA encoding the 14 amino acids of GFP fused to the 
N terminus of the gene encoding the CP4 protein were prepared. These constructs were 

2 0 used to express the protein in plastids. The overexpression of the EPSPS enzyme was 

shown to provide high levels of vegetative tolerance and reproductive tolerance to 
glyphosate. In particular, the fusion protein resulted in an approximately 50 fold increase 
in protem accumulation as compared to a construct which did not include the 14 amino 
acid segment. 

25 As discussed in detail in Example 3, the constructs which included the 14 amino 

acids resulted in high glyphosate tolerance, approximately 128 ounces/acre vegetative and 
approximately 64 ounces/acre reproductive, of transplastomic plants. These levels are 
significantly higher that the levels obtained without the 14 amino acid fusion. 

The expression constructs for use in this invention generally include a promoter 

3 0 region capable of providing for enhanced expression of a DNA sequence, a DNA sequence 

encoding the 14 amino acid sequence from GFP fused to a DNA sequence encoding a 
protein of interest, and a transcription termination region. 
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In a preferred embodiment of the invention, the expression constructs are plastid 
expression constructs and include a promoter and transcription termination region which 
are functional in a plant plastid. In addition, the constructs can include a ribosome binding 
site derived from the T7 bacteriophage polymerase gene 10 leader (GIOL) (see USPN 
5 5,232,840, the entirety of which is incorporated herein by reference), which enhances 
expression of nucleic acid sequences in prokaryotes. These elements are operably joined 
components in the 5' to 3' direction of transcription. 

In developing the constructs, the various fragments comprising the regulatory 
regions and open reading frame can be subjected to different processing conditions, such 
10 as ligation, restriction enzyme digestion, PCR, in vitro mutagenesis, linkers and adapters 
addiUon, and the hke. Thus, nucleotide transitions, transversions, insertions, deletions, or 
the like, may be performed on the DNA which is employed in the regulatory regions or the 
DNA sequences of interest for expression in the plastids. Methods for restriction digests, 
Klenow blunt end treatments, ligations, and the like are well known to those in the art and 
15 are described, for example, by Maniatis et al. (in Molecular Cloning: A Laboratory 
Manual (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 

During the preparation of the constructs, the various fragments of DNA will often 
be cloned in an appropriate cloning vector, which allows for amplification of the DNA, 
modification of the DNA or manipulation of the DNA by joining or removing sequences, 
linkers, or the like. Preferably, the vectors will be capable of replication to at least a 
relatively high copy number in E. coli. A number of vectors are readily available for 
cloning, including such vectors as pBR322, vectors of the pUC series, the Ml 3 series 
vectors, and pBluescript vectors (Stratagene; La Jolla, CA). 

In order to provide a means of selecting the desired plant cells, vectors for plastid 
2 5 transformation typically contain a construct which provides for expression of a selectable 
marker gene. Marker genes are plant-expressible DNA sequences which express a 
polypeptide which resists a natural inhibition by, attenuates, or inactivates a selective 
substance, including, but not limited to, antibiotic, herbicide etc. 

Alternatively, a marker gene may provide some otiier visibly reactive response. j.e., 
may cause a distinctive appearance or growtii pattern relative to plants or plant cells not 
expressing the selectable marker gene in the presence of some substance, either as applied 
directly to the plant or plant cells or as present in the plant or plant cell growth media. 
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In either case, the plants or plant cells containing such selectable marker genes will 
have a distinctive phenotype for purposes of identification, i.e., they will be 
distinguishable from non-transformed cells. The characteristic phenotype allows the 
identification of cells, cell groups, tissues, organs, plant parts or whole plants containing 
5 -rthe'eobstnictr^''"^^ — .- ^ ^. 

Detection of the marker phenotype makes possible the selection of cells having a 
second gene to which the marker gene has been linked. This second gene typically 
comprises a desirable phenotype which is not readily identifiable in transformed cells, but 
which is present when the plant cell or derivative thereof is grown to maturity, even under 

1 0 conditions wherein the selectable marker phenotype itself is not apparent. 

The use of such a marker for identification of plant cells containing a plastid 
construct has been described by Svab et al. (1993, supra). In the examples provided 
below, a bacterial aadA. gene is expressed as the marker under the regulatory control of 
chloroplast 5' promoter and 3' transcription termination regions, specifically the regulatory 

15 regions of thepjfcA gene (described in Staub et al.,EMBO 7.(1993) 12(2):601-606). 
Numerous additional promoter regions can also be used to drive expression of the 
selectable marker gene, including various plastid promoters and bacterial promoters which 
have been shown to function in plant plastids. 

Expression of the aadA gene confers resistance to spectinomycin and streptomycin, 

20 and thus allows for the identification of plant cells expressing tiiis marker. The aadA gene 
product allows for continued growth and greening of cells whose chloroplasts comprise the 
selectable marker gene product. CeUs which do not contain tiie selectable marker gene 
product are bleached. Selection for the aadA gene marker is thus based on identification 
of plant cells which are not bleached by tiie presence of streptomycin, or more preferably 

25 spectinomycin, in the plant growth medium. 

A number of markers have been developed for use witii plant cells, such as 
resistance to chloramphenicol, the aminoglycoside G4 1 8, hygromycin, or the like. Other 
genes which encode a product involved in chloroplast metabolism may also be used as 
selectable markers. For example, genes which provide resistance to plant herbicides such 

30 as glyphosate, bromoxynil or imidazolinone may find particular use. Such genes have 
been reported (Stalker et al, J. Biol. Chem. (1 985) 260:4724-4728 (glyphosate resistant 
EPSP); Stalker et al, J. Biol Chem. (1985) 265:63 10-63 14 (bromoxynU resistant nitrilase 
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gene); and Sathasivan et aL, NucL Acids Res, (1990) 75:2188 (AHAS imidazolinone 
resistance gene)). 

Stable transformation of tobacco plastid genomes by particle bombardment is 
reported (Svab et aL (1990), supra) and Svab et aL (1993), supra). The methods 
5 described therein may be employed to obtain plants homoplasmic for plastid expression 
constructs. 

Generally, bombarded tissue is cultured for approximately 2 days on a cell 
division-promoting media, after which the plant tissue is transferred to a selective media 
containing an inhibitory amount of the particular selective agent, as well as the particular 

10 hormones and other substances necessary to obtain regeneration for that particular plant 

species. Shoots are then subcultured on the same selective media to ensure production and 
selection of homoplasmic shoots. 

Transplastomic tobacco plants are analyzed for a pure population of transformed 
plastid genomes (homoplasmic lines). Homoplasmy is verified using Southern analysis 

15 employing nucleic acid probes spanning a region of the transgene and chloroplast genome 
(i.e. the insertion region). Transplastomic plants which are heteroplasmic (i.e. contain a 
mixture of plastid genomes containing and lacking the transgene) are characterized by a 
hybridization pattern of wild type and transgenic bands. Homoplasmic plants show a 
hybridization pattern lacking the wild type band. 

2 0 Alternatively, homoplasmy may be verified using the polymerase chain reaction 

(PGR). PGR primers are utilized which are targeted to amplify from sequences from the 
insertion region. For example, a pair of primers may be utilized in a PGR reaction. One 
primer amplifies from a region in the transgene, while the second primer amplifies from a 
region proximal to the insertion region towards the insertion region. A second PGR 

2 5 reaction is performed using primers designed to amplify the region of insertion. 

Transplastomic lines identified as homoplasmic produce the expected size fragment in the 
first reaction, while they do not produce the predicted size fragment in the second reaction. 

Where transformation and regeneration methods have been adapted for a given 
plant species, either hy Agrobacterium-mtdizicd transfonnation, bombardment or some 

3 0 other method, the established techniques may be modified for use in selection and 

regeneration methods to produce plastid-transformed plants. For example, the methods 
described herein for tobacco are readily adaptable to other solanaceous species, such as 
tomato, petunia and potato. 
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For transformation of soybean, particle bombardment as well as Agrobacterium- 
mediated nuclear transformation and regeneration protocols have been described (Hinchee 
et al. USPN 5,416,01 1, and Christou et al. USPN 5,015,580). The skilled artisan will 
recognize that protocols described for soybean transformation may be used 

Brassica, Agrobacterium-msdiated transformation and regeneration protocols 
generally involve the use of hypocotyl tissue, a non-green tissue which might contain a low 
plastid content. Thus, for Brassica, preferred target tissues would include microspore- 
derived hypocotyl or cotyledonary tissues (which are green and thus contain numerous 
plastids) or leaf tissue explants. While the regeneration rates from such, tissues may be 
low, positional effects, such as seen with Agrobacterium-mtdisXQd transformation, are not 
expected, thus it would not be necessary to screen numerous successfully transformed 
plants in order to obtain a desired phenotype. 

For cotton, transformation of Gossypium hirsutum L. cotyledons by co-cultivation 
with Agrobacterium tumefaciens has been described by Firoozabady et al.. Plant Mol. Bio. 
(1987) 70: 105-1 16 and Umbeck et al, Bio/Technology (1987) 5:263-266. Again, as for 
Brassica, this tissue may contain insufficient plastid content for chloroplast 
transformation. Thus, as for Brassica, an alternative method for transformation and 
regeneration of alternative target tissue containing chloroplasts may be desirable, for 
instance targeting green embryogenic tissue. 

Other plant species may be similarly transformed using related techniques. 
Alternatively, microprojectile bombardment methods, such as described by Klein et al. 
{Bio/T echnology 70:286-291) may also be used to obtain nuclear transformed plants 
comprising the viral single subunit RNA polymerase expression constructs described 
herein. Cotton transformation by particle bombardment is reported in WO 92/15675, 
published September 17, 1992. Suitable plants for the practice of the present invention 
include, but are not limited to, soybean, cotton, alfalfa, oil seed rape, flax, tomato, sugar 
beet, sunflower, potato, tobacco, maize, wheat, rice and lettuce. 

The vectors for use in plastid transformation preferably include means for 
providing a stable transfer of the plastid expression construct and selectable marker 
constnict into the plastid genome. This is most conveniently provided by regions of 
homology to the target plastid genome. The regions of homology flank the construct to be 
transferred and provide for transfer to the plastid genome by homologous recombination, 
via a double crossover into the genome. The complete DNA sequence of the plastid 
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genome of tobacco has been reported (Shinozaki et al, EMBO 7. (1986) 5:2043-2049). 
Complete DNA sequences of the plastid genomes from liverwort (Ohyama^/ a/.. Nature 
(1986) 522:572-574) and rice (Hiratsuka et al, Mol Gen, Genet. (1989) 277:185-194), 
have also been reported. 

" 5 -~Where:the regions. of;homology are present in^the inverted repeat regions of the 

plastid genome (known as IRA and IRB), two copies of the transgene are expected per 
transformed plastid. Where the regions of homology are present outside the inverted 
repeat regions of the plastid genome, one copy of the transgene is expected per 
transformed plastid. The regions of homology within the plastid genome are 

1 0 approximately Ikb in size. Smaller regions of homology may also be used, and as little as 
100 bp can provide for homologous recombination into the plastid genome. However, the 
frequency of recombination and thus the frequency of obtaining plants having transformed 
plastids decreases with decreasing size of the homology regions. 

Examples of constructs having regions of homology within the plastid genome are 

15 described in Svab et.al (1990 supra), Svab et al. (1993 supra) and Zoubenko et al (Nuc 
Acid Res (1994) 22(19):3819-3824). 

Furthermore, the constructs of the present invention may also include sequences to 
target the expressed protein to a particular suborganellar region, for example, the thylakoid 
lumen of the chloroplast. Such targeting of expressed proteins may provide for a 

2 0 compartmentalization of the protein allowing for increased oxidative stability and proper 

protein folding. 

It is contemplated that the 14 amino acids from the GFP can be fused to any protein 
for which the nucleotide sequence has been identified and used in the constructs of the 
present invention to obtain increased protein accumulation. However, it is particularly 
25 contemplated that the protein will confer herbicide tolerance or will be a human biological 
protein (pharmaceutical protein), the sequences of which are well known in the art. 

In a preferred embodiment, the 14 amino acids from the GFP are fused to the 
terminus of EPSP synthase. EPSP synthase genes are used to produce glyphosate tolerant 
plants. A gene for EPSP synthase has been cloned from Agrobacterium tumefaciens sp 

3 0 strain CP4 (USPN 5,633,435). Additional glyphosate tolerant EPSPS genes are described 

in U.S. Patent Number 5,627,061 , Padgette et al (1996) Herbicide Resistant Crops, Lewis 
Publishers, 53-85, and in Penaloza- Vazquez, et al (1995) Plant Cell Reports 14:482-487, 
the entireties of which are incorporated herein by reference. 
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In addition to engineering plants for glyphosate tolerance, plants have also been 
engineered to tolerate other classes of herbicides such as halogenated 
hydroxybenzonitriles, and norflurazon. Halogenated hydroxybenzonitriles, such as 
Bromoxynil, are suggested to act herbicidally by inhibiting the quinone-binding protein 
conipIexiDf photosysteih^II, inhibiting electron transfer ^ Van Rensen (1982) Physiol Plant 
54:515-520,and Sanders and Pallett (1986) Pestic, Biochem, Physiol 26:116-122). 
Herbicides such as norflurazon inhibit the production of carotenoids. Plants which are 
resistant to Bromoxynil have been produced by expressing DNA sequences encodmg for 
enzymes capable of detoxifying Bromoxynil (nitrilases) in the plant cell nucleus. DNA 
sequences encoding for such nitrilases have been cloned from bacteria such as Klebsiella 
pneumoniae and used to construct vectors to direct the expression of the DNA sequence in 
plant cell nucleus (USPN 4,810,648, the entirety of which is incorporated herein by 
reference). Plants which are resistant to Norflurazon have been engineered by expressing 
nucleic acid sequences which encode for enzymes in the carotenoid biosynthetic pathway 
in plant cell nuclei. For example, expressing a phytoene desaturase from Erwinia 
uredovora provides tolerance to norflurazon. 

In addition, constructs can be prepared which include a translational fusion of the 
fourteen amino acids from GFP and a gene, including but not limited to the bar gene for 
tolerance to glufosinate (DeBlock, et al (1987) EMBO 7. 6:2513-2519); stress tolerance 
genes, for example insect or disease resistance/tolerance genes, including the gene 
encoding the Bacillus thuringensis cryl Ac protem; genes encoding human biological 
proteins including aprotinin, human growth hormone, insulin and insulin precursors. 

An artisan skilled in the art to which the present invention pertains will recognize 
that additional sequences can be fused to the fourteen amino acids from GFP to increase 
the expression of these proteins. 

It is also contemplated that the translational fusion of the fourteen amino acids 
from GFP and a protein of interest can be used in any known expression system to increase 
the expression of the protein. Expression systems for use in the present invention can 
include prokaryotic and eukaryotic systems. Such expression systems can include, but are 
not limited to. ftingal expression systems, bacterial expression systems, mammahan cell 
expression systems, and plant expression systems. 

Thus, the constructs and methods of the present invention provide a means for 
obtaining transplastomic plants with high level of protein expression and of tolerance of 
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herbicides. High levels of tolerance include tolerance of vegetative tissue when amounts 
of greater than about 64 oz/acre, most preferably greater than about 1 28 oz/acre. 
Furthermore, high levels of tolerance can also include tolerance of reproductive tissues 
when amounts of greater than about 32 oz/acre, most preferably greater than about 64 

High levels of protein expression include levels of protein expression of greater 
than about 10 percent total soluble protein, and preferably greater than about 12 percent of 
the total soluble protein of a host ceU. 

The invention now being generally described, it will be more readily understood by 
0 reference to the following examples which are included for purposes of illustration only 
and are not intended to limit the present invention. 



EXAMPLES 

15 Example 1 Expression Constructs 

Constructs and methods for use in transforming the plastids of higher plants are 
described in Zoubenko et al (Nuc Acid Res (1994) 22(19):38 19-3824), Svab et al. {Proc 
Natl. Acad. Sci.(1990) 87:8526-8530 and Proc. Natl. Acad. 5c/.(1993) 90:913-917) and 
Staub et al. (EMBO J. (1 993) 1 2:601-606). Constructs and methods for use in 

2 0 transforming plastids of higher plants to express DNA sequences under the control of a 
nuclearly encoded, plastid targeted T7 polymerase are described in U.S. Patent Number 
5,576,198. The complete DNA sequences of the pla.stid genome of tobacco are reported 
by Shinozaki et al. (EMBO J. (1986) 5:2043-2049). All plastid DNA references in the 
following description are to the nucleotide number from tobacco. 

2 5 The complete nucleotide sequence encoding the tobacco cytochrome/ (pe/A)is 

described in Bassham et al, (1991) J Biol Chem 266:23606-23610 and Konishi etal. 
(1993) Plant Cell Physiol 34:1081-1087. 



lA. Promoter/Ribosome Binding Site Sequences 

The promoter region of the plastid 16S ribosomal RNA operon (Prm) is linked to a 
synthetic ribosome binding site (RBS) patterned on the plastid rbcL gene leader to create 
the.Prm/rbcLRBS fragment. The Prm/rbcLRBS sequence is as described in Svab et al. 
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(1993, supra) for the Pr/7z/rbcL(S) fragment. Figure 1 provides the nucleic acid sequence 
for the Prm/GIOL promoter/RBS hybrid. 

The promoter region of the plastid psbA promoter (PpsbA) and terminator 
sequences (TpsbA) are described in Staub et al. (1993. EMBO J., 12, 601-606). 

-The Prm/GIOL sequence was constructed by annealing two oligonucleotide 
sequences, T71eadl and T71ead2 (Table 1), to create the GIOL plastid ribosome binding 
site (Figure 1). The GIOL sequence was ligated to the 3' terminus of the ¥rm promoter 
sequence as an EcdKUNcol fragment to create the Prm/GIOL sequence. 

Table 1 



T71eadl 5 '-AAT TGT AGA AAT AAT TTT GTT TAA CTT TAA GAA GGA 

GAT ATA CC-3' 

T71ead2 5 ' -CAT GGG TAT ATC TCC TTC TTA AAG TTA AAC AAA ATT 
ATT TCT AC-3' 

Chimeric genes are preferably inserted mto the expression vector to direct their 
transcripUon from the Prm promoter. Thus, in the plastid genome, chimeric genes are 
transcribed from the Prm/RBS promoter, or the Prm/GIOL promoter in the plant plastid. 

IB. CP4 EPSPS Plastid Expression Constructs 

The expression vectors used in the Examples are described below and in the maps 
provided in Figure 2. 

A plastid expression vector pMON301 17 is constructed from a precursor vector 
pPRVl 1 IB (Zoubenko. et al. 1994, supra, GenBank accession U12813). The vector 
pMON30117 carries a multiple cloning site for insertion of a passenger gene in a 
Prm/rbcLRBS/Tr/>5] 6 expression cassette. The Prm/rbcLRBS sequence is cloned into 
pPRVl 1 IB vector as an EcoJWNcol fragment, and the terminator region from the plastid 
rpsl6 gene(Trpil6) is cloned 3' of the Prm promoter as a Hindm/Ncol fragment. The 
Trpsie fragment comprises the ;77sl6 gene 3 '-regulatory region from nucleotides 5,087 to 
4,939 in the tobacco plasmid DNA. 

The pPRVl 1 IB backbone of the vector pMON301 17 contains a marker gene, 
aadA. for selection on specdnomycin and streptomycin, and rps 7/12 for the integration, 
by homologous recombination, of the passenger DNA into trnW-rpsVU intergenic region. 
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The pJastid expression construct pMON30] 18 was prepared by cloning the native 
CP4 EPSPS gene fused with the N-terminal five (5) amino acids from the plastid rbcL 
(described in Svab et al., 1993 supra) gene as an NcoVSmal fragment into the multiple 
cloning site of the vector pMON301 17. 

The plastid expression construct pMON30123 is essenually the same as 
PMON301 18 with the exception of the deletion of the N-terminal five (5) amino acids 
from the plastid rbcL. 

The plastid expression construct pMON30130 was created by replacing the native 
CP4 EPSPS of PMON30123. with a synthetic CP4 gene. This construct also lacks the N- 
terminal 5 amino acid fusion from the plastid rbcL gene. 

The plastid expression constnict pMON38773 was constructed by replacing the 
Pr/71/RBS sequence of pMON30123 with the Prnj/GIOL promoter sequence described 
above. The EPSPS DNA sequence of pMON38773 also lacks the N-terminal 5 amino acid 
fusion from the plastid rbcL gene. 

A plastid expression construct, pMON38766 was constructed using the promoter 
from T7 phage gene 10 (P-T7). including GIOL. CP4 (native) gene coding region, and the 
terminator sequence from plastid ipsl6 gene (Trpsl6). 

A plastid expression construct, pMON38797 was constructed using the promoter 
from T7 phage gene 10 (P-T7), including GIOL. CP4 (synthetic) gene coding region, 
terminator from plastid rpsl6 gene (Trpsl6). 

A plastid expression construct. pMON38798 was constiricted using the promoter 
of the leSrDNA operon (Prm), GIOL, CP4 (synthetic) gene coding region, terminator 
from plastid rps 1 6 gene (Trps 16). 

A plastid expression construct, pMON38793 was constructed using the promoter 
of the 16SrDNA operon (Prm). a synthetic ribosome binding site (RBS) patterned from the 
plastid rbcL gene, the glyphosate tolerant Petunia EPSP synthase gene (P-EPSPS, 
Padgette, et al{mi) Arch. Biochem. Biophys. 258:564-573) carrying the mutation 
Glycine to Alanine at amino acid position 101, terminator from plastid rpsl6 gene 
(Trps 16). 

A plastid expression construct, pMON38796 was constructed using the promoter 
of the leSrDNA operon (Prm). synthetic ribosome binding site (RBS) patterned from the 
plastid rbcL gene, the glyphosate tolerant Achromobacter (sQ-ain LBAA) EPSP synthase 
gene (U.S. Patent Number 5.627,061, the entirety of which is incoiporated herein by 
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reference) carrying the mutation Glycine to Alanine at amino acid position 100, terminator 
from plastid rpsl 6 gene (Trpsl6). 

A plastid expression construct, pMON45204, was constructed using the promoter 
of the 16SrDNA operon (Prm) with the GIOL, the glyphosate tolerant Pseudomonas (strain 
5 LB A A)^EPSP-synthase-gene-carrying the mutation .Glycine to Alanine at amino acid 
position 100, terminator from plastid rpsl 6 gene (Trpsl6). 

A plastid expression construct, pMON45201, was constructed using the promoter 
of the 16SrDNA operon (Prm), synthetic ribosome binding site (RBS) patterned from the 
plastid rbcL gene, wild-type glyphosate tolerant BaciEus subtilis aroE (EPSPS) (U.S. 
10 Patent Number 5,627,061) gene, terminator from plastid rpsl6 gene (Trpsl6). 

A plastid expression construct, pMON45259, was constructed using the promoter 
of the leSrDNA operon (Prm) with the GIOL sequence functionally associated with the 
nucleic acid sequence encoding the synthetic CP4 protein having an additional sequence at 
the N-terminus encoding the first 14 amino acids of the green fluorescent protein (GFP) 
15 (GKGEELFTGVVPSM). The sequence encoding the 14 amino acid GFP fusion begins at 
the glycine in the second position of the protein. The construct also contains the rpsl 6 
terminator. A detailed map of expression construct pMON45259 is set forth in Figure 3. 

Another plastid expression construct, pMON49218, was constructed to express the 
synthetic CP4 sequence with the 14 amino acid GFP fusion from the promoter region of 
20 the 1 6SrDNA operon having the nuclear-encoded RNA polymerase region 

(PrmPEP+NEP), and the terminator region from the plastid rpsl 6 gene. A detailed map of 
expression construct pMON49218 is provided in Figure 3 and the DNA sequence of the 
Prm/NEP/G lOL: : 14aaGFP fusion is provided in Figure 4. 

25 Example 2 Plant Transformation 

2A. Nuclear Transformation 

Tobacco plants transformed to express the constructs pWRG4744 and pWRG4747 
in the nucleus of a plant cell can be obtained as described by Horsch et al. {Science (1985) 
227:1229-1232), 

30 

2B. Plastid Transformation 
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Tobacco plastids are transformed by particle gun delivery of microprojectiles as 
described by Svab and Maliga {Proc, Natl Acad. ScL (1993) 90:913-917), and described 
herein. 

Dark green, round leaves are cut, preferably from the middle of the shoots, from 3- 
5 6 week old Nicotima tabdcum cVTHSvanr v7hi*cH have bera ina^^^^^ on 
hormone free MS medium (Murashige and Skoog, (1962) Physiol Plant, 15, 473-497) 
supplemented with B5 vitamins in Phyiatrays or sundae cups with a 16 hour photoperiod at 
24°C. Each cut leaf is then placed adaxial side up on sterile filter paper over tobacco shoot 
regeneration medium (TSO medium: MS salts, lmg/1 ;V^-benzyladenine, O.lmg/1 1- 

10 naphthaleneacetic acid, 1 mg/1 thiamine, lOOmg/1 inositol, 7g/l agar pH 5.8 and 30g/l 

sucrose). Leaves are preferably placed in the center of the plate with as much contact with 
the medium as possible. The plates are preferably prepared immediately prior to use, but 
may be prepared up to a day before transformation by particle bombardment by wrapping 
in plastic bags and storing at lA^'C overnight. 

15 Tungsten or gold particles are sterilized for use as microcarriers in bombardment 

experiments. Particles (50mg) are sterilized with 1 ml of 100% ethanol, and stored at 
-20**C or -80°C. Immediately prior to use, particles are sedimented by centrifugation, 
washed with 2 to 3 washes of 1 ml sterile deionised distilled water, vortexed and 
centrifuged between each wash. Washed particles are resuspended in 500 |il 50% glycerol. 

2 0 Sterilized particles are coated with DNA for transformation. Twenty-five 

microliter aliquots of sterilized particles are added to a 1 .5 ml microfuge tube, and 5 of 
DNA of interest is added and mixed by tapping. Thirty-five microliters of a freshly 
prepared solution of 1.8M CaCh and 30 mM spermidine is added to the particle/DNA 
mixture, mixed gently, and incubated at room temperature for 20 minutes. The coated 

2 5 particles are sedimented by centrifuging briefly. The particles are washed twice by adding 
200)Xl 70% ethanol, mixing gently, and centrifuging briefly. The coated particles are 
resuspended in 50^1 of 100% ethanol and mixed gently. Five to ten microliters of coated 
particles are used for each bombardment. 

Transformation by particle bombardment is carried out using the PDS 1000 Helium 

30 gun (Bio Rad, Richmond, CA) using a modified protocol described by the manufacturer. 

Plates containing the leaf samples are placed on the second shelf from the bottom 
of the vacuum chamber and bombarded using the 1 100 p.s.i. rupture disk. After 

15 
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bombardment, petxiplates containing the leaf samples are wrapped in plastic bags and 
incubated at 24°C for 48 hours. 

After incubation, bombarded leaves are cut into approximately 0.5 cm^ pieces and 
placed abaxial side up on TSO medium supplemented with 500 ^g/ml spectinomycin. 
5^ "*Aftef 3 to^41\re^kroirthe-yele 

will appear on the leaf tissue. These shoots will continue to grow on spectinomycin 
containing medium and are referred to as primary putative transformants. 

When the primary putative transformants have developed 2 to 3 leaves, 2 small 
pieces (approximately 0.5 cm^) are cut from each leaf and used for either selection or for a 

1 0 second round of shoot regeneration. One piece is placed abaxial side up on plates 

containing TSO medium supplemented with 500 ^ig/ml spectinomycin, and the other piece 
is placed abaxial side up on TSO medium supplemented with 500 ^ig/ml each of 
spectinomycin and streptomycin. Positive transformants are identified as the shoots which 
form green callus on the TSO medium containing spectinomycin and streptomycin. 

1 5 After 3 to 4 weeks, the tissue placed on TSO medium containing only 

spectinomycin, which has been identified as positive on the TSO medium with 
spectinomycin and streptomycin, will develop green shoots. Two to four shoots of each 
positive transfonnant are selected and transferred to TSO medium supplemented with 500 
^ig/ml spectinomycin for generation of roots. Southern analysis is performed on 2 shoots 

20 to confirm homoplasmy as described below. Shoots from homoplasmic events are 
transferred to the greenhouse for seed production, while transformants which are not 
homoplasmic are sent through a second round or regeneration on TSO medium with 500 
|ag/ml spectinomycin to attain homoplasmy. 

2 5 Example 3 Analysis of Transplastomic Tobacco Plants Transformed with Herbicide 

Tolerance Constructs 

3A. Western Blot Analysis of Tobacco CP4 EPSPS 

To determine the expression of the EPSPS Western blot analysis was performed on 
a single line from each construct, pMON30123, pMON30130, pMON38773, 

3 0 pMON38798, pMON45259 and pMON4921 8. 

Total soluble protein was extracted from frozen leaf tissue by grinding 250 mg 
tissue in 250^1 of PBS buffer (1 mM KH2PO4, Na2HP04, 0.1 37M NaCl, 2.7 mM KCl pH 
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7.0) containing protease inhibitors. The homogenate is centrifuged for 5 minutes, and the 
supernatant is transferred to a fresh tube. The concentration of the protein in the 
supernatant is determined using a protein concentration assay (BioRad, Richmond, CA). 

Extracted total protein is electrophoresed on a 4-20% SDS-PAGE gel (Sigma, St 
-Louis,-M0)7 and-transferred to PVDF membrane in Ix SDS-PAGE buffer (Maniatis et al 
1989, Cold Spring Harbor Press). Standards of quantitated purified CP4 EPSPS protein 
were used to quantify the expression of the CP4 EPSPS as expressed in the plant plastid. 

Western hybridizations are performed as described in Staub and Maliga (1993) 
EMBO Journal 12(2) 601-606, except using antibodies raised to EPSPS. PVDF 
membranes containing the transferred electrophoresed protein were incubated in a 
blocking solution of PBS buffer containing 0.05% Tween-20 (PBS-T) and 5% milk 
overnight at 4°C. The membranes are then incubated in a solution of PBS-T containing 
1 % milk and a primary antibody raised in goats to the CP4 EPSPS for 2 hours at room 
temperature. The membranes are washed three times in a solution of PBS-T containing 
0.1% milk, each wash for 5 minutes at room temperature. The membranes are then 
mcubated in a solution of PBS-T containing 1% milk and sheep anti-goat antibody for 1 
hour at room temperature, and washed again in PBS-T containing 0.1% milk, three times 
for 10 minutes at room temperature. A final wash using only PBS-T is performed before 
developing the membranes using a nonradioactive detection kit (ECL, Amersham), 

The results are shown in Table I set forth below. As shown, expression constructs 
containing the N-terminal 14 amino acid portion from GFP demonstrated significantly 
higher levels of protein expression. Transplastomic hnes containing either pMON45259 
or pMON49218 demonstrated total soluble protein of greater than 12% CP4 EPSPS. In 
contrast, the constructs without the 14 amino acids produced between 0.001 and 0.2% of 
total soluble protein as EPSPS. 

In addition, other transplastomic lines were analyzed for tolerance to spraying with 
various levels of glyphosate. Homoplasmic tobacco plants are sprayed with glyphosate 
using a track sprayer at concentrations corresponding to 0 ounces/acre, 16 ounces/acre, 32 
ounces/acre, 64 ounces/acre and 128 ounces/acre to test for whole plant tolerance. Plant 
height was measured before and after spraying with glyphosate. The vegetative injury data 
was collected two weeks after spraying, while the reproductive injury data was collected at 
plant maturity. Tolerance is characterized by the continued growth and greening of tissues 
sprayed with glyphosate. Specific activity is measured as the amount of exogenously 
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added Phosphoenol pyruvate (PEP) converted to Shikimate-3-phosphate (S3P) per unit 
protein in the plant extract. The addition of glyphosate tests sensitivity of the EPSPS 
enzyme to glyphosate. The results are summarized in Table L 

Table I 
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As shown, the constructs with the 14 amino acids from GFP produced 
transplatomic plants with higher levels of glyphosate tolerance. In particular, lines 
pMON45259 and pMON49218 are shown to provide tolerance to glyphosate applied at 
levels of at least approximately 128 oz/acre on vegetative tissues, and at least 
approximately 64 oz/acre on reproductive tissues. 

All publications and patent applications mentioned in this specification are 
indicative of the level of skill of those skilled in the art to which this invention pertains. 
All publications and patent applications are herein incorporated by reference to the same 
extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understandmg, it will be obvious that 
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certain changes and modifications may be practiced within the scope of the appended 
claim. 
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CLAIMS 

What is claimed is: 

5 LA construct comprising the following components in the 5' to 3' direction of 

transcription: 

a promoter region; 

a first DN A sequence encoding at least the first five amino acids of the following 
sequence: Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ser Met 
10 a second DNA sequence encoding a protein of interest; and 

a transcription termination region. 



2. The construct according to claim 1 wherein said first DNA sequence 
encodes at least fourteen acids. 

3. The construct of claim 1 wherein said second DNA sequence encodes a 
gene which confers tolerance in a plant cell to at least one herbicide when said second 
DNA sequence is transcribed in said plant cell, 

20 4. The construct according to claim 1, wherein said construct further 

comprises a gene encoding a selectable marker. 



5. The construct according to claim 1, wherein said promoter is functional in a 
plant plastid. 

6. The construct according to claim 5, wherein said construct further 
comprises a ribosome binding site joined to said promoter. 

7. The construct according to claim 6, wherein said ribosome binding site is 

3 0 from a leader sequence selected from the group consisting of sites derived from plastid, 

bacterial or bacteriophage leader sequences. 

8. The construct according to claim 7, wherein said ribosome binding site is 
selected from the group consisting of the binding site of the gene 10 leader and the 

35 rbcLRBS site. 

9. The construct according to claim 3 wherein said second DNA sequence encodes 
a gene which confers tolerance to the herbicide glyphosate. 

40 10. The construct according to claim 9 wherein said second DNA sequence 

encodes a glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthase. 

11. The construct according to claim 10 wherein said second DNA sequence is 
selected from the group consisting of the E. coli or Salmonella aroA gene, the CP4 gene, 

4 5 mutant petunia EPSPS, mutant EPSPS gene of Psuedomonas strain LB AA, and the 

Bacillus subtilis aroE gene. 
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12. The construct according to claim 1 wherein said second DNA sequence is 
encodes a gene selected from the group consisting of a imidizalinone-tolerant AHAS gene, 
the ALS gene, a phosphinothricin-tolerant gene, the BAR gene, an enzyme of the 
carotenoid pathway, the crti gene, and a bromoxynil-tolerant gene. 

5 

13. A cell containing the construct according to claim 1. 

14. The cell according to claim 13 wherein said cell is a plant cell. 

10 15. The cell according to claim 13 wherein said cell is a plant plastid. 

16. The ceU according to claim 13 wherein said cell comprises at least 
approximately 12 % of total soluble protein as a protein expressed from said second DNA 
sequence. 
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17. A cell containing the construct according to claim 11. 



18. The cell according to claim 17 wherein said cell comprises at least 
approximately 12 % of total soluble protein as a protein expressed from said second DNA 

20 sequence. 

19. A plant, plant seed, plant cell or progeny thereof containing a plant plastid 
according to claim 15. 

25 20. A plant, plant seed, plant cell or progeny thereof containing a plant plastid 

according to claim 17. 

21 . The plant according to claim 20 wherein said plant is tolerant of the 
herbicide glyphosate when said herbicide is applied at a rate selected from the group 

3 0 consisting of at least about 64 ounces per acre and at least about 128 ounces per acre. 

22. A method for enhancing protein expression, comprising: 
transforming a cell with a construct comprising the following components 

in the 5* to 3' direction of transcription: 
35 a promoter region; 

a first DNA sequence encoding at least the first five amino acids of the following 
sequence: Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ser Met 
a second DNA sequence encoding a protein of interest; and 
a transcription tennination region; and 
40 growing said cells under conditions wherein said first and said second DNA 

sequences are transcribed. 

23. The method according to claim 22 wherein said first DNA sequence 
encodes the fourteen amino acids of the sequence. 

45 

24. The method according to claim 23 wherein said promoter is functional in a 
plant plastid. 
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25. The method according to claim 24 wherein said second DNA sequence 
encodes a gene which confers tolerance to the herbicide glyphosate. 

26. The method according to claim 25 wherein said second DNA sequence 
5 encodes a glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthase. 

: -27. The method according to claim 25 wherein said second DNA sequence is 
selected from the group consisting of the E. coli or Salmonella aroA gene, the CP4 gene, 
mutant petunia EPSPS, mutant EFSPS gene of Psuedomonas strain LBAA, and the 
10 Bacillus subtilis aroE gene. 

28. The method according to claim 23 wherein said second DNA sequence is 
encodes a gene selected from the group consisting of a imidizalinone-tolerant AHAS gene, 
the ALS gene, a phosphinothricin-tolerant gene, the BAR gene, an enzyme of the 

1 5 carotenoid pathway, the crti gene, and a bromoxynil-tolerant gene. 

29. A method of producing tolerance of a herbicide in a plant cell, wherein said 
method comprises: 

transforming said plant cell with a construct comprising the following as operably 
2 0 joined components in the 5' to 3' direction of transcription: 

a promoter functional in a plant cell; 

a first DNA sequence encoding at least the first five amino acids of the following 
sequence: Gly Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ser Met 

a DNA sequence which is capable of conferring in a plant cell tolerance to at least 
2 5 one herbicide; and 

a transcription termination region, and 

growing said plant cell under conditions wherein said DNA sequence is transcribed 
whereby a plant cell is rendered tolerant to applications of said at least one herbicide 
compound. 

30 

30. An herbicide tolerant plant cell produced according to the method of claim 29. 

. 3 1 . A plant, plant seed or plant part comprising a plant cell according to claim 30. 

35 32. A plant cell according to claim 30 and comprising greater than about 12 % of 

total soluble protein as a protein expressed from said herbicide-tolerance gene. 

33. A plant cell according to claim 30 wherein said herbicide-tolerance gene is a 
glyphosate-tolerant 5-enolpyruvylshikimate-3-phosphate synthase. 

40 

34. A plant; plant seed or plant part comprising a plant cell according to Claim 33. 

35. A plant according to claim 34 tolerant of the herbicide glyphosate when said 
herbicide is applied at a rate selected from the group consisting of at least about 64 ounces 

45 per acre and at least about 128 ounces per acre. 
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(57) Abstract: Constructs which include a translational fusion of 14 amino acids derived from the green fluorescent protein are 
provided for the enhanced expression of proteins. Methods for the use of such constructs are also provided. 
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